AWS S3 Glacier Storage: Guide to Cold Data Archiving
What is AWS S3 Glacier Storage? Your Guide to Low-Cost Cold Data Archiving
Does your business store terabytes of old customer records, raw media files, or compliance data that you rarely touch but can’t delete? Paying for high-performance hot storage for this “cold” data is a massive waste of budget. AWS S3 Glacier Storage is Amazon’s purpose-built solution for exactly this problem: low-cost, durable storage for data you access infrequently, but need to keep long-term.
What is AWS S3 Glacier Storage?
AWS S3 Glacier Storage is a cloud-based cold storage service part of the Amazon S3 (Simple Storage Service) ecosystem. It’s designed for data that is accessed once a year or less, with storage costs up to 90% lower than standard S3 hot storage tiers.
All Glacier storage classes offer 99.999999999% (11 9’s) data durability, meaning your archived data is replicated across multiple AWS availability zones to prevent loss. It’s fully integrated with other AWS services, so you can automate data archiving, retrieval, and lifecycle management without custom tooling.
How S3 Glacier Fits Into the S3 Storage Ecosystem
Amazon S3 offers multiple storage classes tailored to data access patterns: S3 Standard for frequently accessed data, S3 Standard-IA for infrequently accessed hot data, and the S3 Glacier family for cold, archival data. Glacier is not a separate service—it’s a set of storage classes you can assign to S3 objects directly or via automated lifecycle policies.
AWS S3 Glacier Storage Classes
The S3 Glacier family includes three specialized storage classes, each optimized for different access needs and retrieval times:
S3 Glacier Instant Retrieval
Best for cold data that needs millisecond access when requested, such as old media files or compliance records you may need to pull quickly. Retrieval is immediate, with no waiting period. Storage costs are ~70% lower than S3 Standard-IA.
Minimum storage duration: 90 days. Early deletion fees apply if you delete data before 90 days.
S3 Glacier Flexible Retrieval (Formerly Amazon S3 Glacier)
Designed for data accessed 1-2 times per year, with flexible retrieval options ranging from minutes to hours. You can choose expedited (1-5 minutes), standard (3-5 hours), or bulk (5-12 hours) retrieval, depending on your urgency and budget.
Minimum storage duration: 90 days. Ideal for backup data, disaster recovery archives, and long-term media archiving.
S3 Glacier Deep Archive
The lowest-cost storage option in AWS, designed for data accessed once a year or less, such as regulatory archives that only need to be pulled for audits. Retrieval times range from 12-48 hours.
Minimum storage duration: 180 days. It’s up to 95% cheaper than S3 Standard, making it the most cost-effective option for true cold data.
Key Benefits of AWS S3 Glacier Storage
- Unmatched cost efficiency: Pay a fraction of hot storage costs for data you rarely access, with no upfront hardware or maintenance fees.
- Enterprise-grade durability: 11 9’s of data durability ensures your archived data is safe from hardware failure or outages.
- Flexible retrieval options: Choose retrieval speeds that balance cost and urgency, from milliseconds to 48 hours.
- Compliance ready: Meets regulatory requirements for HIPAA, GDPR, PCI-DSS, and more, with audit logs and access controls.
- Seamless S3 integration: Use S3 lifecycle policies to automatically move aging data to Glacier classes, no manual intervention needed.
Common Use Cases for AWS S3 Glacier
Most businesses use S3 Glacier for:
- Regulatory compliance archiving (financial records, healthcare data, tax documents)
- Media and entertainment asset archiving (raw footage, old video files, completed project assets)
- Disaster recovery backups for on-premise or cloud workloads
- Scientific and research data (large datasets accessed infrequently)
- Application log archiving for troubleshooting or audit purposes
AWS S3 Glacier Pricing: How Much Does It Cost?
S3 Glacier pricing is based on four factors:
- Storage volume (per GB per month)
- Retrieval requests (per GB retrieved)
- Retrieval speed (expedited retrieval costs more than bulk)
- Early deletion fees (if you delete data before the minimum storage duration)
For example, as of 2024, S3 Glacier Deep Archive costs $0.00099 per GB per month in the US East (N. Virginia) region—meaning 1TB of storage costs just ~$1 per month. Always check the AWS official S3 pricing page for the latest regional rates.
Pro tip: Use S3 Intelligent-Tiering if you’re unsure of your data access patterns—it will automatically move data to the cheapest applicable tier, including Glacier classes, with no retrieval fees.
How to Get Started with AWS S3 Glacier
Migrating to S3 Glacier takes just a few steps:
- Log into the AWS Management Console and navigate to the S3 service.
- Create a new S3 bucket or select an existing one for your archived data.
- Set up S3 lifecycle policies to automatically move objects to your chosen Glacier class after a set period (e.g., move data to Glacier Deep Archive after 1 year).
- For existing data, use the S3 console, AWS CLI, or SDK to manually change the storage class of objects to a Glacier tier.
- Configure retrieval permissions to control who can access archived data in your organization.
For a deeper dive into S3 lifecycle policies, refer to our upcoming guide to AWS S3 Storage Classes and Lifecycle Management.
FAQ: AWS S3 Glacier Storage
Q: What’s the difference between S3 Glacier Flexible Retrieval and Deep Archive?
A: Flexible Retrieval is for data accessed 1-2 times a year with retrieval times from minutes to hours, while Deep Archive is for data accessed once a year or less with 12-48 hour retrieval times. Deep Archive is ~40% cheaper than Flexible Retrieval.
Q: How long does it take to retrieve data from AWS S3 Glacier?
A: Retrieval times vary by class: Instant Retrieval is milliseconds, Flexible Retrieval ranges from 1 minute (expedited) to 12 hours (bulk), and Deep Archive takes 12-48 hours.
Q: Can I move existing S3 data to Glacier storage classes?
A: Yes! You can change the storage class of existing S3 objects via the console, CLI, or lifecycle policies. Lifecycle policies are the easiest way to automate this for large datasets.
Q: Is AWS S3 Glacier compliant with data privacy regulations?
A: Yes, S3 Glacier supports compliance with GDPR, HIPAA, PCI-DSS, and other major regulations, with features like encryption at rest, access logging, and audit trails.
AWS S3 Glacier Storage is the gold standard for cost-effective cold data archiving, offering unmatched durability and flexibility for businesses of all sizes. By matching your data access patterns to the right Glacier storage class, you can cut storage costs by up to 90% without sacrificing data availability.
Ready to start saving on cold storage costs? Sign up for the AWS Free Tier today to test S3 Glacier with 5GB of free storage for 12 months. Have questions about choosing the right Glacier class for your workload? Drop them in the comments below!
Comments are closed, but trackbacks and pingbacks are open.